Multi-labeling of complex, multi-behavioral malware samples

نویسندگان

چکیده

The use of malware samples is usually required to test cyber security solutions. For that, the correct typology interest properly estimate exhibited performance tools under evaluation. Although several datasets are publicly available at present, most them not labeled or, if so, only one class or tag assigned each sample. We defend that just label enough represent usual complex behavior by current malware. With this hypothesis in mind, and based on varied classification generally provided automatic detection engines per sample, we introduce here a simple multi-labeling approach automatically multiple samples. In paper, first analyze coherence between behaviors specific number well-known dissected literature tags for our labeling proposal. After scheme executed over four public Android datasets, different results statistics obtained regarding their composition representativeness being discussed. share GitHub repository tool developed, usage.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Labeling Complicated Objects: Multi-View Multi-Instance Multi-Label Learning

Multi-Instance Multi-Label (MIML) is a learning framework where an example is associated with multiple labels and represented by a set of feature vectors (multiple instances). In the formalization of MIML learning, instances come from a single source (single view). To leverage multiple information sources (multi-view), we develop a multi-view MIML framework based on hierarchical Bayesian Networ...

متن کامل

Multi-stack Boundary Labeling Problems

The boundary labeling problem was recently introduced in [5] as a response to the problem of labeling dense point sets with large labels. In boundary labeling, we are given a rectangle R which encloses a set of n sites. Each site is associated with an axis-parallel rectangular label. The main task is to place the labels in distinct positions on the boundary of R, so that they do not overlap, an...

متن کامل

Multi-Predicate Semantic Role Labeling

The current approaches to Semantic Role Labeling (SRL) usually perform role classification for each predicate separately and the interaction among individual predicate’s role labeling is ignored if there is more than one predicate in a sentence. In this paper, we prove that different predicates in a sentence could help each other during SRL. In multi-predicate role labeling, there are mainly tw...

متن کامل

From Multi-Labeling to Multi-Domain-Labeling: A Novel Two-Dimensional Approach to Music Genre Classification

In this publication we describe a novel two-dimensional approach for automatic music genre classification. Although the subject poses a well studied task in Music Information Retrieval, some fundamental issues of genre classification have not been covered so far. Especially many modern genres are influenced by manifold musical styles. Most of all, this holds true for the broad category “World M...

متن کامل

Periodic Multi-labeling of Public Transit Lines

We designed and implemented a simple and fast heuristic for placing multiple labels along edges of a planar network. As a testbed, realworld data from Google Transit is taken: our implementation outputs an overlay onto Google Maps, adding route numbers to public transit lines.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers & Security

سال: 2022

ISSN: ['0167-4048', '1872-6208']

DOI: https://doi.org/10.1016/j.cose.2022.102845